Riemann–Stieltjes integral

From HandWiki
Short description: Generalization of the Riemann integral

In mathematics, the Riemann–Stieltjes integral is a generalization of the Riemann integral, named after Bernhard Riemann and Thomas Joannes Stieltjes. The definition of this integral was first published in 1894 by Stieltjes.[1] It serves as an instructive and useful precursor of the Lebesgue integral, and an invaluable tool in unifying equivalent forms of statistical theorems that apply to discrete and continuous probability.

Formal definition

The Riemann–Stieltjes integral of a real-valued function [math]\displaystyle{ f }[/math] of a real variable on the interval [math]\displaystyle{ [a,b] }[/math] with respect to another real-to-real function [math]\displaystyle{ g }[/math] is denoted by

[math]\displaystyle{ \int_{x=a}^b f(x) \, \mathrm{d}g(x). }[/math]

Its definition uses a sequence of partitions [math]\displaystyle{ P }[/math] of the interval [math]\displaystyle{ [a,b] }[/math]

[math]\displaystyle{ P=\{ a = x_0 \lt x_1 \lt \cdots \lt x_n = b\}. }[/math]

The integral, then, is defined to be the limit, as the mesh (the length of the longest subinterval) of the partitions approaches [math]\displaystyle{ 0 }[/math], of the approximating sum

[math]\displaystyle{ S(P,f,g) = \sum_{i=0}^{n-1} f(c_i)\left[ g(x_{i+1}) - g(x_i) \right] }[/math]

where [math]\displaystyle{ c_i }[/math] is in the [math]\displaystyle{ i }[/math]-th subinterval [math]\displaystyle{ [x_i;x_{i+1}] }[/math]. The two functions [math]\displaystyle{ f }[/math] and [math]\displaystyle{ g }[/math] are respectively called the integrand and the integrator. Typically [math]\displaystyle{ g }[/math] is taken to be monotone (or at least of bounded variation) and right-semicontinuous (however this last is essentially convention). We specifically do not require [math]\displaystyle{ g }[/math] to be continuous, which allows for integrals that have point mass terms.

The "limit" is here understood to be a number A (the value of the Riemann–Stieltjes integral) such that for every ε > 0, there exists δ > 0 such that for every partition P with norm(P) < δ, and for every choice of points ci in [xixi+1],

[math]\displaystyle{ |S(P,f,g)-A| \lt \varepsilon. \, }[/math]

Properties

The Riemann–Stieltjes integral admits integration by parts in the form

[math]\displaystyle{ \int_a^b f(x) \, \mathrm{d}g(x)=f(b)g(b)-f(a)g(a)-\int_a^b g(x) \, \mathrm{d}f(x) }[/math]

and the existence of either integral implies the existence of the other.[2]

On the other hand, a classical result[3] shows that the integral is well-defined if f is α-Hölder continuous and g is β-Hölder continuous with α + β > 1 .

If [math]\displaystyle{ f(x) }[/math] is bounded on [math]\displaystyle{ [a,b] }[/math], [math]\displaystyle{ g(x) }[/math] increases monotonically, and [math]\displaystyle{ g'(x) }[/math] is Riemann integrable, then the Riemann–Stieltjes integral is related to the Riemann integral by [math]\displaystyle{ \int_a^b f(x) \, \mathrm{d}g(x) = \int_a^b f(x) g'(x) \, \mathrm{d}x }[/math]

For a step function [math]\displaystyle{ g(x) = \begin{cases} 0 & \text{if } x \leq s \\ 1 & \text{if } x \gt s \\ \end{cases} }[/math] where [math]\displaystyle{ a \lt s \lt b }[/math], if [math]\displaystyle{ f }[/math] is continuous at [math]\displaystyle{ s }[/math], then [math]\displaystyle{ \int_a^b f(x) \, \mathrm{d}g(x) = f(s) }[/math]

Application to probability theory

If g is the cumulative probability distribution function of a random variable X that has a probability density function with respect to Lebesgue measure, and f is any function for which the expected value [math]\displaystyle{ \operatorname{E}\left[\,\left|f(X)\right|\,\right] }[/math] is finite, then the probability density function of X is the derivative of g and we have

[math]\displaystyle{ \operatorname{E}[f(X)]=\int_{-\infty}^\infty f(x)g'(x)\,\mathrm{d}x. }[/math]

But this formula does not work if X does not have a probability density function with respect to Lebesgue measure. In particular, it does not work if the distribution of X is discrete (i.e., all of the probability is accounted for by point-masses), and even if the cumulative distribution function g is continuous, it does not work if g fails to be absolutely continuous (again, the Cantor function may serve as an example of this failure). But the identity

[math]\displaystyle{ \operatorname{E}[f(X)]=\int_{-\infty}^\infty f(x)\, \mathrm{d}g(x) }[/math]

holds if g is any cumulative probability distribution function on the real line, no matter how ill-behaved. In particular, no matter how ill-behaved the cumulative distribution function g of a random variable X, if the moment E(Xn) exists, then it is equal to

[math]\displaystyle{ \operatorname{E}\left[X^n\right] = \int_{-\infty}^\infty x^n\,\mathrm{d}g(x). }[/math]

Application to functional analysis

The Riemann–Stieltjes integral appears in the original formulation of F. Riesz's theorem which represents the dual space of the Banach space C[a,b] of continuous functions in an interval [a,b] as Riemann–Stieltjes integrals against functions of bounded variation. Later, that theorem was reformulated in terms of measures.

The Riemann–Stieltjes integral also appears in the formulation of the spectral theorem for (non-compact) self-adjoint (or more generally, normal) operators in a Hilbert space. In this theorem, the integral is considered with respect to a spectral family of projections.[4]

Existence of the integral

The best simple existence theorem states that if f is continuous and g is of bounded variation on [a, b], then the integral exists.[5][6][7] Because of the integration by part formula, the integral exists also if the condition on f and g are inversed, that is, if f is of bounded variation and g is continuous.

A function g is of bounded variation if and only if it is the difference between two (bounded) monotone functions. If g is not of bounded variation, then there will be continuous functions which cannot be integrated with respect to g. In general, the integral is not well-defined if f and g share any points of discontinuity, but there are other cases as well.

Geometric interpretation

A 3D plot, with [math]\displaystyle{ x }[/math], [math]\displaystyle{ f(x) }[/math], and [math]\displaystyle{ g(x) }[/math] all along orthogonal axes, leads to a geometric interpretation of the Riemann–Stieltjes integral.[8]

The basic geometry of the Riemann-Stieljes integral.

If the [math]\displaystyle{ g }[/math]-[math]\displaystyle{ x }[/math] plane is horizontal and the [math]\displaystyle{ f }[/math]-direction is pointing upward, then the surface to be considered is like a curved fence. The fence follows the curve traced by [math]\displaystyle{ g(x) }[/math], and the height of the fence is given by [math]\displaystyle{ f(x) }[/math]. The fence is the section of the [math]\displaystyle{ g }[/math]-sheet (i.e., the [math]\displaystyle{ g }[/math] curve extended along the [math]\displaystyle{ f }[/math] axis) that is bounded between the [math]\displaystyle{ g }[/math]-[math]\displaystyle{ x }[/math] plane and the [math]\displaystyle{ f }[/math]-sheet. The Riemann-Stieljes integral is the area of the projection of this fence onto the [math]\displaystyle{ f }[/math]-[math]\displaystyle{ g }[/math] plane — in effect, its "shadow". The slope of [math]\displaystyle{ g(x) }[/math] weights the area of the projection. The values of [math]\displaystyle{ x }[/math] for which [math]\displaystyle{ g(x) }[/math] has the steepest slope [math]\displaystyle{ g'(x) }[/math] correspond to regions of the fence with the greater projection and thereby carry the most weight in the integral.

The effects of curvature in [math]\displaystyle{ g(x) }[/math] on the geometry of the Riemann-Stieljes integral.

When [math]\displaystyle{ g }[/math] is a step function [math]\displaystyle{ g(x) = \begin{cases} 0 & \text{if } x \leq s \\ 1 & \text{if } x \gt s \\ \end{cases} }[/math]

the fence has a rectangular "gate" of width 1 and height equal to [math]\displaystyle{ f(s) }[/math]. Thus the gate, and its projection, have area equal to [math]\displaystyle{ f(s) }[/math], the value of the Riemann-Stieljes integral.

The effect of a step function [math]\displaystyle{ g(x) }[/math] on the geometry of the Riemann-Stieljes integral.

Generalization

An important generalization is the Lebesgue–Stieltjes integral, which generalizes the Riemann–Stieltjes integral in a way analogous to how the Lebesgue integral generalizes the Riemann integral. If improper Riemann–Stieltjes integrals are allowed, then the Lebesgue integral is not strictly more general than the Riemann–Stieltjes integral.

The Riemann–Stieltjes integral also generalizes[citation needed] to the case when either the integrand ƒ or the integrator g take values in a Banach space. If g : [a,b] → X takes values in the Banach space X, then it is natural to assume that it is of strongly bounded variation, meaning that

[math]\displaystyle{ \sup \sum_i \|g(t_{i-1})-g(t_i)\|_X \lt \infty }[/math]

the supremum being taken over all finite partitions

[math]\displaystyle{ a=t_0\le t_1\le\cdots\le t_n=b }[/math]

of the interval [a,b]. This generalization plays a role in the study of semigroups, via the Laplace–Stieltjes transform.

The Itô integral extends the Riemann–Stietjes integral to encompass integrands and integrators which are stochastic processes rather than simple functions; see also stochastic calculus.

Generalized Riemann–Stieltjes integral

A slight generalization[9] is to consider in the above definition partitions P that refine another partition Pε, meaning that P arises from Pε by the addition of points, rather than from partitions with a finer mesh. Specifically, the generalized Riemann–Stieltjes integral of f with respect to g is a number A such that for every ε > 0 there exists a partition Pε such that for every partition P that refines Pε,

[math]\displaystyle{ |S(P,f,g) - A| \lt \varepsilon \, }[/math]

for every choice of points ci in [xixi+1].

This generalization exhibits the Riemann–Stieltjes integral as the Moore–Smith limit on the directed set of partitions of [ab] .[10][11]

A consequence is that with this definition, the integral [math]\displaystyle{ \int_a^b f(x)\,\mathrm{d}g(x) }[/math] can still be defined in cases where f and g have a point of discontinuity in common.

Darboux sums

The Riemann–Stieltjes integral can be efficiently handled using an appropriate generalization of Darboux sums. For a partition P and a nondecreasing function g on [ab] define the upper Darboux sum of f with respect to g by

[math]\displaystyle{ U(P,f,g) = \sum_{i=1}^n \,\, [\,g(x_i)-g(x_{i-1})\,] \,\sup_{x\in [x_{i-1},x_i]} f(x) }[/math]

and the lower sum by

[math]\displaystyle{ L(P,f,g) = \sum_{i=1}^n \,\, [\,g(x_i)-g(x_{i-1})\,] \,\inf_{x\in [x_{i-1},x_i]} f(x). }[/math]

Then the generalized Riemann–Stieltjes of f with respect to g exists if and only if, for every ε > 0, there exists a partition P such that

[math]\displaystyle{ U(P,f,g)-L(P,f,g) \lt \varepsilon. }[/math]

Furthermore, f is Riemann–Stieltjes integrable with respect to g (in the classical sense) if

[math]\displaystyle{ \lim_{\operatorname{mesh}(P)\to 0} [\,U(P,f,g)-L(P,f,g)\,] = 0.\quad }[/math][12]

Examples and special cases

Differentiable g(x)

Given a [math]\displaystyle{ g(x) }[/math] which is continuously differentiable over [math]\displaystyle{ \mathbb{R} }[/math] it can be shown that there is the equality

[math]\displaystyle{ \int_a^b f(x) \, \mathrm{d}g(x) = \int_a^b f(x)g'(x) \, \mathrm{d}x }[/math]

where the integral on the right-hand side is the standard Riemann integral, assuming that [math]\displaystyle{ f }[/math] can be integrated by the Riemann–Stieltjes integral.

More generally, the Riemann integral equals the Riemann–Stieltjes integral if [math]\displaystyle{ g }[/math] is the Lebesgue integral of its derivative; in this case [math]\displaystyle{ g }[/math] is said to be absolutely continuous.

It may be the case that [math]\displaystyle{ g }[/math] has jump discontinuities, or may have derivative zero almost everywhere while still being continuous and increasing (for example, [math]\displaystyle{ g }[/math] could be the Cantor function or “Devil's staircase”), in either of which cases the Riemann–Stieltjes integral is not captured by any expression involving derivatives of g.

Riemann integral

The standard Riemann integral is a special case of the Riemann–Stieltjes integral where [math]\displaystyle{ g(x) = x }[/math].

Rectifier

Consider the function [math]\displaystyle{ g(x) = \max\{ 0, x \} }[/math] used in the study of neural networks, called a rectified linear unit (ReLU). Then the Riemann–Stieltjes can be evaluated as

[math]\displaystyle{ \int_a^b f(x)\,\mathrm{d}g(x) = \int_{g(a)}^{g(b)}f(x)\,\mathrm{d}x }[/math]

where the integral on the right-hand side is the standard Riemann integral.

Cavalieri integration

Visualisation of the Cavaliere integral for the function [math]\displaystyle{ f(x)=(2x+8)^3 }[/math]

Cavalieri's principle can be used to calculate areas bounded by curves using Riemann–Stieltjes integrals.[13] The integration strips of Riemann integration are replaced with strips that are non-rectangular in shape. The method is to transform a "Cavaliere region" with a transformation [math]\displaystyle{ h }[/math], or to use [math]\displaystyle{ g = h^{-1} }[/math] as integrand.

For a given function [math]\displaystyle{ f(x) }[/math] on an interval [math]\displaystyle{ [a,b] }[/math], a "translational function" [math]\displaystyle{ a(y) }[/math] must intersect [math]\displaystyle{ (x,f(x )) }[/math] exactly once for any shift in the interval. A "Cavaliere region" is then bounded by [math]\displaystyle{ f(x),a(y) }[/math], the [math]\displaystyle{ x }[/math]-axis, and [math]\displaystyle{ b(y) = a(y) + (b-a) }[/math]. The area of the region is then

[math]\displaystyle{ \int_{a(y)}^{b(y)} f(x) \, dx \ = \ \int_{a'}^{b'} f(x) \, dg(x) , }[/math]

where [math]\displaystyle{ a' }[/math] and [math]\displaystyle{ b' }[/math] are the [math]\displaystyle{ x }[/math]-values where [math]\displaystyle{ a(y) }[/math] and [math]\displaystyle{ b(y) }[/math] intersect [math]\displaystyle{ f(x) }[/math].

Notes

  1. Stieltjes (1894), pp. 68–71.
  2. Hille & Phillips (1974), §3.3.
  3. Young (1936).
  4. See Riesz & Sz. Nagy (1990) for details.
  5. Johnsonbaugh & Pfaffenberger (2010), p. 219.
  6. Rudin (1964), pp. 121–122.
  7. Kolmogorov & Fomin (1975), p. 368.
  8. Bullock (1988)
  9. Introduced by Pollard (1920) and now standard in analysis.
  10. McShane (1952).
  11. Hildebrandt (1938) calls it the Pollard–Moore–Stieltjes integral.
  12. Graves (1946), Chap. XII, §3.
  13. T. L. Grobler, E. R. Ackermann, A. J. van Zyl & J. C. Olivier Cavaliere integration from Council for Scientific and Industrial Research

References